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Abstract. We present a fast algorithm for the subset convolution problem: given functions / and 
g defined on the lattice of subsets of an n-element set N, compute their subset convolution f * g, 
defined for all S C N by 

(/*<?)(£) = £/cos(£\t). 

f***) ■ where addition and multiplication is carried out in an arbitrary ring. Via Mobius transform and 

inversion, our algorithm evaluates the subset convolution in 0(n 2 2") additions and multiplications, 
£Nj , substantially improving upon the straightforward 0(3") algorithm. Specifically, if the input func- 

tions have an integer range {—M,—M + 1, . . . , M}, their subset convolution over the ordinary 
sum-product ring can be computed in O* (2™ log M) time; the notation O* suppresses polyloga- 
y-^ , rithmic factors. Furthermore, using a standard embedding technique we can compute the subset 

' convolution over the max-sum or min-sum semiring in 0*(2 n M) time. 

t-H \ To demonstrate the applicability of fast subset convolution, we present the first 0*(2 k n 2 + nm) 

O^l ■ algorithm for the minimum Steiner tree problem in graphs with n vertices, k terminals, and m 

edges with bounded integer weights, improving upon the 0*(3 fe n + 2 k n 2 + nm) time bound of the 
classical Dreyfus- Wagner algorithm. We also discuss extensions to recent 0*(2 n )-time algorithms 
for covering and partitioning problems (Bjorklund and Husfeldt, FOCS 2006; Koivisto, FOCS 2006); 
using fast subset convolution we can, for example, find all fc-colorable induced subgraphs of a given 
r/3 , n- vertex graph in 0*(2 n ) time. 

l cj. 

■ 1- Introduction 

> ' 

' 1.1. Background and Main Result. Many hard computational problems admit a recursive 

solution via a convolution-like recursion step over the subsets of an n-element ground set N. More 
precisely, for every S C N, one computes the "solution" h(S) defined by 

^ : (l) h(S) = Y / f(T)g(S\T), 

O : TQS 

where f(T) and g(S\T) are previously computed solutions for the subproblems specified by T and 
S \T, and the arithmetic is carried out in an appropriate semiring; the most common examples 
. in applications being perhaps the integer sum-product ring and the integer max-sum semiring. 

Given / and g, a direct evaluation of h for all S C iV requires 0(3") semiring operations. To our 
knowledge, this is also the fastest known evaluation approach until the present work. 

In a first attempt to improve upon the direct evaluation, the convolution analogy suggests the 
natural approach to evaluate (PQ) as a product of some type of Fourier transforms of / and g via a fast 
Fourier transform (FFT) and its inverse — in general, this approach has proven to be spectacularly 
successful in domains ranging from signal processing to number theory; see [15] for a survey of 
generalized FFTs. For example, considering a slightly different convolution operation of the form 

(2) h'(S) = f{T)g{SAT) , where 5 AT = (S \ T) U (T \ S) , 

TCN 

the Fourier approach immediately yields an evaluation approach requiring 0{n2 n ) ring operations 
via the fast Fourier transform on Z?? , the elementary Abelian group of order 2 n . However, the con- 
volution ([T]) is "truncated" from T C N to T C S, which in effect renders the operation somewhat 
incompatible with group-theoretic Fourier transforms and associated "natural" convolution opera- 
tions performed over the entire group. A simple zero-padding trick allows one to evaluate ([I]) via 
the FFT on Zg (equivalently, the classical n-dimensional FFT with padding on each dimension), 
but this unfortunately does not improve upon the direct evaluation strategy. 
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A second attempt at analogy will prove to be more successful. Indeed, the truncation to T C 5 
and the sum over all T C S in ([1]) suggests a connection with the classical Mobius transform 
[1, 18, 23j on the lattice of subsets of N, which is further motivated by the fact that a fast algorithm 
is known for evaluating the Mobius transform and its inverse on the subset lattice. 

It turns out that — in analogy with the Fourier approach — the evaluation of (pQ) in 0(n 2 2 n ) ring 
operations can be achieved via a product ("convolution over rank") of "ranked" extensions of 
the classical Mobius transforms of / and g on the subset lattice, followed by a "ranked" Mobius 
inversion. This is the main result of this paper. 

Theorem 1. The subset convolution over an arbitrary ring can be evaluated in 0(n 2 2 n ) ring 
operations. 

Furthermore, considerable extensions and variations of the basic fast convolution operation are 
possible, which we expect to find applications beyond the ones we proceed to outline in what 
follows. 

1.2. Application to Specific Computational Problems. Besides the algebraic complexity we 
also study the implementation of the fast subset convolution algorithm on the ordinary sum-product 
ring of integers. The model of computation assumed in our analyses is the random access machine 
with the restriction that arithmetic operations (including comparison) are considered unit-time 
only for integers of constant size. To avoid cumbersome expressions in runtime bounds we may 
use the notation O* to hide polylogarithmic factors, that is, we may denote 0*(r) when we have 
0(rlog d r) for some constant al in the familiar Landau notation. 
Our main result easily implies the following. 

Theorem 2. The subset convolution over the integer sum-product ring can be computed in 
0*(2 n logM) time, provided that the range of the input functions is {— M, —M + 1, . . . , M}. 

Combinatorial optimization problems usually concern the max-sum or min-sum semiring. While 
our fast subset convolution algorithm does not directly apply to semirings where additive inverses 
need not exist, we can, fortunately, embed the integer max-sum (min-sum) semiring into the integer 
sum-product ring. 

Theorem 3. The subset convolution over the integer max-sum (min-sum) semiring can be com- 
puted in 0*(2 n M) time, provided that the range of the input functions is {— M, —M + 1, . . . , M}. 

As an illustrative application of fast subset convolution, we accelerate the classical Dreyfus- 
Wagner algorithm [6] for the minimum Steiner tree problem: given an undirected graph G = (V, E), 
a weight w(e) > for each edge e £ E, and a set of vertices K C V, find a minimum-weight subgraph 
H of G that connects the vertices in K. The Dreyfus-Wagner algorithm runs in 0*(3 k n+2 k n 2 +nm) 
time, where n = |V|, m = \E\, and k = \K\. We give the first 0*(2 k n 2 + nm)-time algorithm, 
provided that the edge weights are small integers. 

The Dreyfus- Wagner algorithm and its variants play a key role in solving various related prob- 
lems. For example, the Dreyfus- Wagner algorithm has recently been used as a subroutine in fixed 
parameter tractable algorithms for certain vertex cover problems [TT], as well as for near-perfect 
phylogenetic tree reconstruction [JJ. Regarding rectilinear Steiner trees (RSTs), Ganley [9] writes: 
"The algorithm of Dreyfus and Wagner is probably the most popular used to date for computing 
optimal RSTs in practice." Furthermore, other "hierarchical partitioning" algorithms similar to 
that of Dreyfus and Wagner seem to appear in the literature with no explicit connection to the 
minimum Steiner tree problem; we will consider in some detail a recent algorithm by Scott, Ideker, 
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Karp, and Sharan [20] for detecting signaling pathways in protein interaction networks. Our im- 
provement via fast subset convolution concerns all these variants and applications, subject to the 
constraint that edge weights can be represented by small integers. 

We also note that many classical graph partitioning problems [10] can be solved by counting all 
valid partitions via recursive application of subset convolution (over the integer sum-product ring) . 
Thus, the present technique can be seen as a generalization of the authors' previous work [31 Q3] 
based on inclusion-exclusion and applies to a wider family of partitioning problems. For example, 
we can now solve extended partitioning problems, such as finding all fc-colorable induced subgraphs 
of a given n-vertex graph, in 0*(2 n ) total time. 

1.3. Related Research and Discussion. Mobius transform and inversion play a central role 
in combinatorial theory, particularly in the theory of partially ordered sets, subset lattices being 
special cases [H 021 [23]. The fast Mobius transform and inversion algorithms on the subset lattice 
can be considered folklore; Kennes [13] gives a formal treatment, but the algorithm is, in essence, 
that of Yates [25] for multiplying a vector of size 2 n by a Kronecker product of n matrices of size 
2 x 2 in 0(n2 n ) operations. As far as we know, the connection of (ranked) Mobius inversion and 
subset convolution has not been studied until the present work. 

For partitioning problems, we (the first two authors and the last author, independently) recently 
found two mutually different inclusion-exclusion algorithms J3[ [T3] , which anticipated the results 
of the present paper. Yet, these earlier results, even when combined, do not immediately yield the 
fast subset convolution algorithm. What remained to be discovered was, in essence, the role of 
(fast) ranked Mobius inversion. 

For the minimum Steiner tree problem, Molle, Richter, and Rossmanith [16] have recently found 
an algorithm that, for any fixed e > 0, runs in 0*((2 + e) k p{n)) time, where p(n) is a polynomial 
function of n. Unfortunately, the degree of p(n) grows rapidly when e approaches zero, which renders 
the algorithm impractical for small e; in a subsequent work [7], the degree of p{n) is improved to 
12\/ e _1 hie -1 , resulting in bounds like 0*(2.5 fc n 14,2 ) and 0*(2.1 fc n 57 ' 6 ). Our accelerated Dreyfus- 
Wagner algorithm is not only theoretically faster, but may also have practical value when k is large 
enough (say, k > 25). 

The idea of embedding the integer max-sum or min-sum semiring into the sum-product ring is 
not new. Some well-known examples are Yuval's [26] and others' [HJ [21] [22] approaches to compute 
shortest paths via (fast) matrix multiplication. In our case the embedding technique provides a more 
substantial gain. Indeed, compared to fast subset convolution, fast matrix multiplication algorithms 
involve large constant factors and their practical value is not clear; the exponential speedup offered 
by fast matrix multiplication is currently by the ratio 3/2.376 [5] and cannot exceed 3/2, whereas 
the exponential speedup offered by fast subset convolution is by the ratio log 3/ log 2 > 1.58 > 3/2. 

1.4. Organization. The next section is devoted to proving our main theorem (Theorem [T]) ; in ad- 
dition, we introduce some variants of the subset convolution problem together with corresponding 
fast algorithms. In Section 3 we give short proofs of Theorems [2] and [3] concerning the implemen- 
tation in the sum-product ring and the max-sum and min-sum semirings, which are essential for 
the applications. We consider the minimum Steiner tree problem in detail in Section 4; also other 
applications to extended partitioning and hypergraph problems are illustrated, but in somewhat 
less detail. 

2. Fast Subset Convolution over a Ring 

Throughout this section we assume that R is an arbitrary (possibly noncommutative) ring and 
that N is a set of n elements, n > 0. 
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2.1. Subset Convolution. Let / be a function that associates with every subset S C N an 
element /(<5) of the ring R. For two such functions, / and g, define the convolution f * g for all 
S C iV by 

(3) (/* 5 )(S) = ^/(T) 5 (5\T), 

res' 

or, equivalently, in a more symmetric form 

(4) (f*g)(S)= Y, /(W)- 

U. V c s 
u uv = s 
unv = <D 

It follows that the convolution operation is associative (and commutative if R is commutative). 



2.2. Mobius Transform and Inversion on the Subset Lattice. We recall the classical Mobius 
transform and inversion formulas on the subset lattice together with their fast evaluation algorithms. 
Let / be a function that associates with every subset S C N an element f(S) of the ring R. The 
Mobius transform of / is the function / that associates with every X C N the ring element 

(5) f(X) = H S ) ■ 

sex 

Given the Mobius transform /, the original function / may be recovered via the Mobius inversion 
formula 

(6) f(S)= f(X). 

xcs 

The Mobius transform ([5j) can be computed in 0(n2 n ) ring operations, which constitutes the fast 
Mobius transform. By relabeling if necessary, we may assume that N = {1, 2, . . . , n}. To compute 
/ given /, let initially 

f (X) = f(X) 

for all ICJV, and then iterate for all j = 1, 2, . . . , n and X C N as follows: 



(7) fj(X) 



fj-i(X) xj?x, 
f j - 1 (X\{j}) + f j ^ 1 (x) ttjex. 



It is straightforward to verify by induction on j that this recurrence gives f n (X) = f(X) for all 
X C N in 0(n2 n ) ring operations. The inversion operation ([6]) can be implemented in a similar 
fashion. To compute / given /, let initially 

MS) = f(s) 

for all SON, and then iterate for all j = 1, 2, . . . , n and S C N as follows: 



(8) MS) - 

Then we have f n (S) = f(S) for all S C N. 



fj-i(S) ifj^S, 
-/ i _a(S'\{j}) + /i-i(5) ifieS. 
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2.3. Ranked Mobius Transform and Inversion. Let / be a function that associates with every 
subset S C TV" an element f(S) of the ring R. The ranked Mobius transform of / is the function / 
that associates with every k = 0, 1, . . . , n and X C N the ring element 

(9) f(k,X)= E 

sex 

S = fe 

In particular, the classical Mobius transform of / is obtained in terms of the ranked transform 

\x\ 

by taking the sum over k, that is, f{X) = ^[.Jq f(k, X). For the ranked transform, inversion is 
achieved simply by 

(10) f(S) = f(\S\,S), 
or, in a somewhat more redundant form, 

(11) f(s) = j2(-^ xl Hnx). 

xcs 

This latter expression, rather than the former one, provides the key to fast evaluation of the subset 
convolution (J3j) - Namely, we will "invert" a function that, in general, cannot be represented via 
ranked Mobius transform but via a convolution (over rank) of two such transforms. 

To set the stage, it is immediate that the ranked transform ([9]) can be computed in 0(n 2 2 n ) ring 
operations by carrying out the standard fast transform ([7]) independently for each k = 0, 1,... ,n. 
Similarly, the ranked inversion (jlip can be computed in 0(n 2 2 n ) ring operations by carrying out 
the standard fast inversion ([8]) independently for each k = 0, 1, . . . , n. 

2.4. Fast Subset Convolution. For two ranked Mobius transforms, / and g, define the convolu- 
tion / © g for all k = 0, 1, . . . , n and X C N by 

k 

(12) (/ © g)(k, X)=J2 /0'> X)g{k - j, X) . 

j=0 

Note that this convolution operation is over the rank parameter rather than over the subset pa- 
rameter. 

It now holds that the inversion operation (jlip applied to / © g gives f * g. Indeed, first observe 
by ([9]) and ([ED that for any S C N we have 



\s\ 

J2(-l) ] ^ X Kf®9)(\S\,X)=Y / (-l) |SVq E fU,X)g(k-j,X) 
xcs xcs j=o 

(13) \S\ 

= E(- 1 ) |S ^'E E mm- 

XCS j=0 U, V C X 

\U\ =0 
\V\ = \S\-j 

Because X ranges over all subsets of S, it follows that for any ordered pair (U, V) of subsets of S 
satisfying |J7| + \V\ = \S\, the term f(U)g(V) occurs in the sum with sign (— l)l s \^ exactly once 
for every X satisfying U U V C X C S. No other terms occur in the sum. Thus, collecting the 
terms associated with each pair (U, V) together, the coefficient of f(U)g(V) is 1 if U U V = S and 
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otherwise. Because \U\ + |V| = \S\ and U U V = S together imply U n V = 0, it follows that ([15]) 
and @ agree. In other words, 

(14) (f*g)(S)= 

Given / and 5, we can now evaluate f * g in 0{n 2 2 n ) ring operations by first computing the fast 
ranked Mobius transform of / and <?, then taking the convolution (|12p of the transforms / and 5, 
and inverting the result using fast ranked Mobius inversion. This establishes Theorem [T] 

2.5. Variants and Extensions. There are two immediate ways to relax the subset convolution 
@. First, the covering product is defined for all S C N by 

(15) (f* c g)(S)= Yl f( u )9(V). 

U, V C s 
uuv = s 

Second, the packing product is defined for all S C N by 

(16) (f* p g)(S)= f( U )9(V). 

t/.vcs 
un v = 

Given / and g, the covering product (fT5l) can be evaluated in 0(n2 n ) ring operations by com- 
puting the Mobius transforms / and g, taking the elementwise (Hadamard) product (fg)(X) = 
f{X)g{X) of the transforms, and inverting the result using fast Mobius inversion. Indeed, observe 
first th-Qjt 

E(- 1 ) ra (/5)w = E(- 1 ) |sxx| E mm. 

xcs xcs uycx 

Now, for each ordered pair (U,V) of subsets of S, the coefficient of the term f(U)g(V) is 1 if 
U U V = S and otherwise. Thus, 

(17) (f* e g){S)= £(-i) |SV *'(/$)(x)- 

xcs 

Given / and 5, the packing product f * p g can be evaluated in 0(n 2 2 n ) ring operations by first 
computing the subset convolution f * g and then convolving the result with the vector 1 with all 
entries equal to 1. Indeed, based on @ it is not difficult to check that 

f* p g = f*g*l = f*l*g = l*f*g. 

Besides the immediate extensions (|15p and f)16f) . also somewhat more subtle variants are possible. 
For example, using (|4|) and (|15p . define the intersecting covering product for all S C iV by 

(18) (/*ic<7)(S)= E /(W^)- 

y c s 
uuv = s 
u n v # 

A fast evaluation algorithm is now immediate from the observation / *j c g = f * c g — f * g. Also 
more precise control over the allowed intersection cardinalities \U H V\ = £ besides the i = (f * g) 
and I > (f*i c g) cases can be obtained by modifying fjl2[> : however, we will not enter into detailed 
discussion. Some further variations are possible by restricting the domain, e.g., to any hereditary 
family of subsets of N; we omit the details. 
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3. Model of Computation and The Choice of Ring 

Up to this point we have worked with an abstract ring R, and have considered only the number of 
ring operations (addition, subtraction, multiplication) required to carry out the computations. To 
arrive at a more accurate analysis of the required computational effort, we must choose a concrete 
ring R, fix a representation for its elements, and evaluate the required effort in a model that parallels 
the operation of an actual physical computer. In what follows, the model of computation is the 
random access machine with the restriction that arithmetic operations (including comparison) are 
considered unit-time only for constant-size integers. In this model, two 6-bit integers can be added, 
subtracted, and compared in 0(b) time, and multiplied in 0(b log b log log b) = 0*(b) time |19| . 

3.1. Integer Sum— Product Ring. We prove Theorem [2j We consider the subset convolution; 
similar argumentation applies to the other variants in !2.51 By Theorem [TJ we know that the subset 
convolution can be computed in 0(n 2 2 n ) ring operations. It is thus sufficient to notice that any 
intermediate results, for which ring operations are performed, are 0(nlogM)-bit integers. To 
see this, note first that the ranked Mobius transform of an input function can be computed with 
integers between —M2 n and M2 n . Given this we note that the convolution of ranked transforms 
can be computed with 0(n log M)-bit integers. Finally, the ranked Mobius inversion is computed 
by adding (and subtracting) 0(n log M)-bit integers 0(2 n ) times. 

3.2. Integer Max— Sum and Min Sum Semirings. We prove Theorem [3l We consider the 
case of max-sum semiring; similar argumentation applies to the min-sum semiring. Without loss 
of generality we assume that the range of the input functions is {0, 1, . . . , M}; otherwise, we may 
first add M to each value of both input functions, compute the convolution, and finally subtract 
2M to get the correct output. 

Let / and g be the two input functions. Let (3 = 2 n +l and M' = f3 M . Define new mappings /' and 
g' from the subsets of N to {0, 1, . . . , M'} by /' = f3* and g' = (i 9 . By Theorem[2]we can compute the 
subset convolution /' * g' over the integer sum-product ring in 0*{2 n logM') = 0*(2 n M) time. It 
remains to show that we can, for all S C N, efficiently deduce the value of max.Tcs{f (T) + g(S\T)} 
given the value of ^2 TQS f'(T)g'(S \ T). 

We observe that, for all S C N, we have a polynomial representation 

(/' * 9')(S) = £ 0f( T )+^ s \ T ) = a (S) + ai (S)(3 + ■■■ + a 2M (S)(3 2M , 

TCS 

where, due to the choice of (3, each coefficient a r (S) is uniquely determined and equals the number 
of subsets T of S for which f(T) + g(S \ T) = r. Thus, for each S C N, we can find the largest r 
for which a r (S) > in 0*(M) time. This completes the proof. 

4. Applications 

4.1. The Minimum Steiner Tree Problem. The Steiner tree problem is a classical NP-hard 
problem. Given an undirected graph G = (V, E), a weight w(e) > for each edge e £ E, and a set 
of vertices K C V, the task is to find a subgraph H of G that connects the vertices in K and has 
the minimum total weight ^2 e& E(H) w ( e ) among all such subgraphs of G. Because the edge weights 
are positive, an optimal subgraph H is necessarily a tree (a Steiner tree) with leaves in K. 

In accordance with the supposed model of computation, we require that the edge weights are 
integers from {1,2,..., M}. To simplify some expressions, we assume that M is a constant. 
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4.1.1. Dreyfus-Wagner Recursion. Dreyfus and Wagner [6| discovered a beautiful dynamic pro- 
gramming algorithm for finding a Steiner tree in 0*(3 k n + 2 k n 2 + nm) time, where n = \V\, 
m = \E\, and k = \K\. 

The key idea in the Dreyfus-Wagner algorithm is that a Steiner tree H connecting a given 
subset of vertices Y C V in G has the following optimal decomposition property, assuming \ Y\ > 3. 
For every q G Y, there exists a vertex p G V, a nonempty proper subset D C Y \ {q}, and a 
decomposition E(H) = E(Hi)UE(H2)UE(H^) such that (a) H\ is a Steiner tree connecting {p, q} 
in G, (b) i?2 is a Steiner tree connecting {p} U D in G, and (c) #3 is a Steiner tree connecting 
{p} U(y\(DU {(?})) in C (See [6j for a proof.) Note that the decomposition may be degenerate, 
e.g., we can have p = q, implying that H\ is empty. 

The optimal decomposition property enables the following Dreyfus-Wagner recursion. For a 
vertex subset Y OV, denote by W(Y) the total weight of a Steiner tree connecting Y in G. To set 
up the base case, observe that for \Y\ < 1 the weight W(Y) = and for \Y\ = 2 the weight W(Y) 
can be determined by a shortest-path computation based on the edge weights w(e). For \Y\ > 3 the 
optimal decomposition property implies that we have for all q G Y and X = Y \ {q} the recursion 

(19) W({q}UX) = mm{W({p,q})+g p {X) : p G V} , 



(20) g p (X) = min {W({p} U D) + W({p} U (X \ D)) : OcflCl}. 

The original problem can be solved by computing the weight W(K) via this recursion. A bottom- 
up evaluation of W(K) relying on dynamic programming takes the claimed 0*(3 k n + 2 k n 2 + nm) 
time; first all-pairs shortest paths are computed in 0*(n 2 + nm) time (in 0(n 2 log n + nm) basic 
operations) using, e.g., Johnson's algorithm [12J. Once the values VF({p} U Y) and g p (Y) for all 
Fci and p £ V have been computed and stored, an actual Steiner tree that achieves the optimal 
weight W(K) is easy to construct by tracing backwards a path of optimal choices in (fl9|) and ([20]) 
[6]; this costs only 0(2 k + kn) simple operations, that is, 0*{2 k logra + kn) time. 

4.1.2. Expediting the Dreyfus-Wagner Recursion. We apply the fast subset convolution over the 
min-sum semiring to expedite the evaluation of the Dreyfus- Wagner recursion in (I20p . However, 
we cannot simply replace (|20l) by fast subset convolution as each g p (X) is defined in terms of other 
values g r (Z), for Z C X and r £ V, which need to be precomputed. To this end, we carry out the 
computations in a level-wise manner. 

For each level £ = 2, 3, . . . , k — 1 in turn, assume the value VF({g} U X) has been computed and 
stored for all X C K with \X\ < I — 1 and q G V \ X. To compute g p (X) for each p G V and 
X C -K" with |X| = £ > 2, define the function f p for all X C K by 



(21) / P (X) 



^({pJUX) ifl<|X|<£-l, 
I oo otherwise . 



Here we let oo in (I21|) denote an integer that is sufficiently large to exceed the weight of any tree in 
G; for example, (n — 1)M + 1 suffices. Applying the subset convolution over the min-sum semiring, 
it is now immediate from (I20p and (I2ip that g p (X) = (f p * f p )(X) holds for all X C X with 
| Jf | < ^. Thus, by Theorem El we can compute g p (X) for all p G V and X CZ K with |A| = £ using 
n evaluations of the subset convolution with integers bounded by nM, which leads to 0*(2 k n 2 ) 
total time; note that the O* notation hides a factor of k 3 . In fact, we can do even better and 
save a factor of k by replacing the subset convolution with the covering product over the min-sum 
semiring. To see this, observe that because W(Z) < W(Y) holds whenever Z C Y C V, we have 
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that ([20]) can also be computed as g p (X) = (f p * c f p )(X), that is, 

g p (X) = mm{W({p}UT) +W({p}UU) : C T,U C X, T U U = X} . 

Once the values g p (X) have been computed, it is easy to compute W^dg} U X) for all X C K 
and q £ V \ X with \X\ = £ in 0*(Qn 2 ) time using (119j) . Computing the above steps for all 
levels £ = 2, 3, . . . , k — 1 takes O* (2 fc n 2 + nm) total time, including the time needed for computing 
all-pairs shortest paths. Finally, a Steiner tree can be constructed within the same time bound 
(see ^4.1. ip . We have thus established the following theorem, which we state in a form without the 
assumption that M is constant. 

Theorem 4. The minimum Steiner tree problem with edge weights in {1,2, . . . , M} can be solved 
in 0*(2 k n 2 M + nm log M) time. 

4.2. A Rooted Tree Model for Signaling Pathways. Scott, Ideker, Karp, and Sharan [20] 
consider various models for signaling pathways in protein interaction networks. One of the two 
more general models they introduce is based on rooted trees, and leads to the following network 
problem. Given an undirected graph G = (V, E), a weight w(e) for each edge e £ E, a vertex subset 
I C V, and a positive integer k, the task is to find for each vertex v £ V a tree of the minimum 
total weight among all /c-vertex subtrees in G that are rooted at v and in which every leaf belongs 
to I. 

Scott et al. [20] apply the color coding method of Alon, Yuster, and Zwick [2], which proceeds by 
carrying out a sequence of randomized trials. In each trial, every vertex v is given independently 
and uniformly at random a color c(v) £ {1, 2, . . . , k}, and the following subtask is solved: for each 
vertex v £ V and subset S C {1, . . . , k} that contains c(v), find a minimum-weight subtree with \S\ 
vertices that is (a) rooted at v, (b) contains a node of each color in S, and (c) in which every leaf 
belongs to /. Scott et al. give the following recurrence for the associated minimum weight, denoted 
by W(v,S): 

W(v, S) = min{A{v, S), B(v, S)} , 

where 

A(v, S) = min{ W{u, S \ {c(v)}) + w(u, v) : c(u) £ S\ {c{v)}} , 

B(v, S) = mm{W{v, T) + W(v, U) : TnU = {c(«)}, T U U = S} , 

with W(v, {c{v )}) = if v £ I and W(v, {c{v )}) = oo otherwise. A direct evaluation of this 
recurrence can be carried out in 0*(3 k m) time |20j . where m = \E\. 

Armed with fast subset convolution, we can speed up the evaluation of the recurrence to 0*(2 k m) 
time, assuming that the edge weights are small integers. Namely, proceeding simultaneously for 
all sets S of a given cardinality, the computation of B(v,S) can be reduced to subset convolution 
over the integer min-sum semiring; the transformation is analogous to the one used in £ 14.1,2] so 
we omit details. 

Scott et al. [20] also consider a different model based on two-terminal series-parallel graphs. In 
this case, too, the original (9*(3 fc n 2 ) algorithm can be accelerated to an 0*(2 k n 2 ) algorithm by 
using fast subset convolution. 

4.3. Partitioning Problems and Extensions. Consider the generic problem of partitioning 
an n-element set into k disjoint subsets that each satisfy some desired property specified by an 
indicator function / on the subsets of N. Given, N, k, and / as input, the task is to decide whether 
there exists a partition {Si, S%, . . . , Sk} of N such that f(S c ) = 1 for each c = 1, 2, . . . , k. Many 
classical graph partitioning problems are of this form. For example, in graph coloring f(S) = 1 if 
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and only if S is an independent set in the input graph with the vertices N. Likewise, in domatic 
partitioning / is the indicator of dominating sets. 

Recently we O [2] discovered two different algorithms that solve the generic partitioning problem 
using the principle of inclusion and exclusion in 0*(2 n ) time, provided that f(S) can be evaluated 
for all S C N in 0*(2 n ) total time. Using fast subset convolution we obtain yet another 0*(2 n ) 
algorithm. Indeed, we observe that the number of valid partitions of N is given by f* k (N), where 

r k = /*/*•••*/ . 

V v ' 

k times 

Thus, we can count the valid partitions by k — 1 subset convolutions, or even better, in 0(logA;) 
convolutions by using the doubling trick. 

What is more, we can solve considerable extensions of partitioning problems within the same 
runtime bound. For example, we can find a maximal &;-colorable induced subgraph (in fact, all such 
subgraphs) in 0*(2 n ) time by computing f* k (S) for all vertex subsets S C N. In a similar fashion, 
but using the packing product, we can decide whether the input graph G contains k disjoint cliques 
each of size at least £ in 0*(2 n ) time: we check if f* pk (N) > 0, where f(S) = 1 if S is a clique in 
G with \S\ > £, and f(S) = otherwise. 

Fast subset convolution allows us to solve not only flat partitioning problems but also hierarchical 
partitioning problems in 0*(2 n ) time. Consider, for example, a branching process that partitions 
the ground set N in a tree-structured manner, as follows. With probability a, a node S C N is 
split uniformly at random into two proper subsets T C S and S\T C S, which are then further 
partitioned recursively; with the remaining probability 1 — a, the branching terminates at S, and 
S becomes a leaf of the tree. With each possible leaf LCjVwe associate a number f(L), and by 
g(N) we denote the expected value of the product of f(L) over all leafs of the (random) tree. Then 
g(N) can be solved through a recursion for S C N: 

g (S) = (l-a)f(S)+a- w ±-^ 9(T)g(S\T). 

0CTC5 

Using fast subset convolution we can compute g(S) for all S C N in a total of 0*(2 n ) arithmetic 
operations. 

4.4. Spanning Problems in Hypergraphs. We conclude this section by illustrating more sub- 
tle applications to two NP-hard hypergraph problems (see [17\ I24j). We begin by recalling the 
appropriate hypergraph terminology. A hypergraph is a pair Ti = (V, £), where V is a finite set 
and £ is a set consisting of subsets of V. A hypergraph J = (W,J-) is a subhypergraph of Ti 
if W C V and T C £. A subhypergraph is spanning if V = W . A path in a hypergraph Ti 
is a sequence (xi, Ex, X2, E2, . . . , E^,x^ + i) such that (a) xi,x%,...,xi+i 6 V are all distinct, (b) 
Ex,E2, ■ ■ ■ , Ei G £ are all distinct, and (c) Xj, Xj+i G i£, for all i = 1, 2, . . . , i. A path joms x\ to 
s^+i. A hypergraph is connected if for all distinct x,y £ V there exists a path joining x to y. A 
connected hypergraph is a tree if for all distinct x, y £ V the path joining x to y is unique. 

The minimum connected spanning subhypergraph (MCSH) problem asks, given a hypergraph 
Ti = iy,£) and a weight w(E) > for each hyperedge E £ £ , to produce a connected spanning 
subhypergraph of that has the minimum total weight, or to assert that none exists. The minimum 
spanning tree (MSTH) problem is otherwise similar to the MCSH problem, but in addition it is 
required that the subhypergraph must be a tree. 

Assuming that w(E) £ {1,2, ... ,M} for all E £ £ , both the MCSH problem and the MSTH 
problem can be solved in time 0*(2 n M) using variants of the fast subset convolution over the 
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min-sum semiring, where n = \ V\. Indeed, define the function / for all E C V by 

I oo otherwise . 

To solve the MCSH problem, we employ the intersecting covering product (|18p . Define the /cth 
power of the intersecting covering product for all k = 2, 3, . . . by 

r icfc = /*ic(r ic(fc - 1} ), r icl = /. 

Here the order in which the products are evaluated is relevant because the intersecting covering 
product is not associative. Now observe that (a) a MCSH can be constructed by augmenting a 
connected subhypergraph of TC one hyperedge at a time, and (b) at most n — 1 hyperedges occur in 
a MCSH of H. Thus, f*' lck iV) < oo is the minimum weight of a connected spanning subhypergraph 
of TL consisting of k hyperedges. By storing the functions f* ick for each k = 1, 2, . . . , n — 1, the 
actual MCSH can be determined by tracing back the computation one edge at a time. To solve 
the MSTH problem, replace the intersecting covering product (I18p with an intersecting covering 
product that in addition requires the cardinality of the intersection to be exactly 1; such a product 
can be obtained by a minor modification of (| 1 2 j) . 
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